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Life is a non-equilibrium process involving the handling of information, both internal 
(about the organism) and external (about the environment). It comes therefore 
as no surprise that the mechanisms of information processing in living beings are 
highly sophisticated. About four decades ago it was recognized by both Jacques 
Ninio [1] and J.J. Hopheld [2] that nucleic acid transcription and translation could 
achieve unexpected hdelity by inserting a strongly irreversible step in the process, 
a mechanism that was given the name of kinetic proofreading. The necessity of 
an irreversible step in error correction is in agreement with the principle stated by 
Rolf Landauer [3], that any logically irreversible manipulation of information must 
be accompanied by a corresponding entropy increase either in the system itself or 
in the environment (cf. [1]). The validity of this principle far from equilibrium has 
been recently conhrmed by M. Esposito and C. van den Broeck [5] on the basis of 
stochastic thermodynamics. Hopheld and collaborators were able to collect evidence of 
kinetic proofreading mechanisms in tRNA aminoacylation A different scheme in 

which high hdelity can be achieved in these fundamental molecular processes was later 
suggested by C. Bennett [8]. One may therefore argue that one of the fundamental 
requirements of living processes—the need for an enhanced hdelity—necessarily implies 
that these processes take place far from thermodynamical equilibrium. The subtle 
thermodynamics involved in copying a biopolymer on a template has been more recently 
investigated by D. Andrieux and P. Gaspard p cni [m as well as by J. R. Arias- 
Gonzalez [12]. Detailed kinetic models of the DNA transcription process have been 
analyzed, from the point of view of both their thermodynamic efficiency and their 
hdelity, by M. Voliotis and collaborators PI- A more general model, involving 
several reaction loops, has been thoroughly investigated by A. Murugan, D. Huse 
and S. Leibler PI- This work has pointed out that the Hopheld model can work 
with high accuracy and speed in a special regime which had remained unnoticed in the 
original work. This analysis has led to identify some rather general design principle for 
proofreading networks and to point out that a network with a given architecture (with 
given reaction constants) can work in several regimes, switching from one to another 
by just changing the distance from equilibrium (e.g., by manipulating the ATP/ADP 
imbalance) |15] . 

Our understanding of the features which determine the performance of proofreading 
networks has been improved by the work of P. Sartori and S. Pigolotti [16]. These 
authors emphasize the difference between energetic and kinetic discrimination regimes, 
according to the mechanism which leads the enzyme to discriminate between substrates. 
Energetic discrimination relies on a difference in substrate-enzyme affinity, while 
kinetic discrimination relies on a difference in the base reaction rate. In the simple 
copolymerization scheme introduced by G. Bennett in [8], the former becomes more 
precise when the reaction takes place closer to equilibrium, while the latter becomes 
more precise when the reaction takes place faster. Thus the difference between the 
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Ninio and Hopfield scheme on the one side, and the Bennett one on the other side, 
appears to be mainly due to the fact that the hrst one works in the energetic regime, 
and the other in the kinetic one. 

The insights gained by recent progress in non-equilibrium statistical physics and 
stochastic thermodynamics on the thermodynamics of information have been recently 
reviewed in HZ]. In [18] the thermodynamics of information has been exploited to set 
limits to the amount of information that can be stored in the memory of a stochastic 
measurement device given an energy budget. An efficiency measure for information 
handling in cellular sensing was proposed in [19] , and the analogy between sensing and 
kinetic proofreading has been pointed out and discussed in [20] . 

In the present work we introduce a thermodynamic description for enzyme-assisted 
assembly processes embedding kinetic proofreading. We thus reconsider a number of 
simple proofreading schemes in the light of these recent developments, and especially 
of the classihcation of discrimination regimes proposed by Sartori and Pigolotti. We 
are able to introduce a measure of the efficiency of the process, which is related to the 
entropy production, and discuss how the error rate, the reaction speed and the efficiency 
vary, for a given architecture, as a function of one another. We also consider in a simple 
setting the effects of error correlations in polymer synthesis on a template. 

The general reaction scheme for enzyme-assisted assembly with proofreading is 
described in sec. O In the same section we provide the kinetic and thermodynamic 
description, and all the relevant quantities are dehned. The simplest reaction scheme, 
the Michaelis-Menten (MM) model (without proofreading), is dehned and discussed in 
sec. [3l The ground-breaking model introduced by Ninio and Hopheld is discussed in 
sec. m The more general scheme introduced by Murugan, Huse and Leibler is taken 
into account in sec. |5l In all these models, the enzyme assisted reaction is treated 
as a renewal process, in that when the product is released, the enzyme goes back to 
its neutral, “bare”, state. In order to make a hrst step in understanding the role of 
correlations in this kind of process, we introduce and discuss in sec. [6] a simple model in 
which the ehect of a correct or wrong incorporation ahects the rate constant of the next 
reaction. We close the work with some conclusions and hints for further investigations. 

2. Description 

Enzyme-catalyzed chemical reaction networks involving competing substrates represent 
the typical scheme appearing in transcription processes. In this framework, the enzyme 
catalyzes the transcription reactions for one of the competing substrates, e.g., for one 
activated nucleotide in DNA replication or transcription. More generally, one can 
consider an enzyme-catalyzed assembly reaction like, e.g., tRNA aminoacylation, in 
which the tRNA-enzyme complex captures an activated amino acid in the cytoplasm 
and either binds it to the tRNA or rejects it. However, only one of the substrate 
carries on the correct information and leads to the correct hnal product. The question 
which thus arises is how the enzyme can reduce the rate of errors for a given energy 
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Figure 1. General scheme of an enzyme-assisted assembly process. 


consumption. 

The general scheme of an enzyme-catalyzed assembly is described in hg. [H The 
enzyme, in its free state &, reacts with one of the competing substrates Sj, thus forming 
an activated complex. The chemical enzyme-substrate complex then undergoes a series 
of chemical reactions, which represent the intermediate stages needed to complete the 
assembly process. During these stages the enzyme performs the proofreading, i.e., it 
can dissociate from the substrate by means of the proofreading pathway. This is more 
likely to happen if the substrate is the wrong one. Otherwise, the catalysis transition 
terminates the step, leading the system to the hnal product state Si (corresponding 
to the substrate Sj), and restoring the free enzyme state &. Let us remark that the 
intermediate reactions could also involve further chemical species, denoted in £g. [1] 
by {aj} and {bj}. They can either be part of the hnal product state or provide the 
system with the chemical energy needed to carry on the processes (one example is ATP 
hydrolysis m p. 80-87]). In our framework, we treat these species as chemicals whose 
concentrations are kept constant by the environment. This assumption simplihes the 
description, in that the only species whose concentration varies in time is the enzyme- 
substrate complex. Also, the chemical network can thus be correctly represented by a 
graph—the proper way to represent generic chemical networks is by means of hyper¬ 
graphs [22l [23]. In this way, the nodes of the graph represent the metastable states of 
the complex, and the reactions the oriented arcs which connect these states pTl |25] . 

We shall call internal all the reaction steps of the process (including the 
proofreading one), except the hnal catalysis step. Internal reactions are assumed to 
be reversible and their rates can be expressed in the Kramers form: 

k = Qe-^, ( 1 ) 

where D is a constant characterizing the reaction pathway (i.e., it is equal for the 
inverse reaction), while A takes into account both the reaction activation energy and 
the concentrations of the external chemical species. (We assume that all reactions take 
place in an environment with a constant absolute temperature T, and we set k^T = 1 
from now on.) In this way, the rates determine the non-equilibrium conditions and local 
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detailed balance is preserved. 

The final catalysis transition terminates the assembly process. Biologically, it 
corresponds to the stage beyond of which the enzyme does not process that substrate 
anymore. Thus, the final product states Sj can be regarded as absorption states of the 
process, where the free-enzyme state is restored and a new assembly process can start 
again. We can thus describe the occurrence of many instances of the process, one after 
the other, by disregarding their final product state and by considering this reaction as 
leading irreversibly to the free-enzyme state (cf. [211 § 8])- 

This description is tacitly based on the assumption that the steps are similar and 
independent, and that there is no correlation between successive steps. This is not 
always the case. For example, in transcription processes involving a template strand, 
like DNA replication and RNA transcription, the secondary structure of the strands 
imposes specific correlations between the monomers [26]. These correlations, in turn, 
imply that the strand configuration plays a (fundamental) role in both the individuation 
and the proofreading of errors 1271 . 

We can take into account these correlations in two ways: 

• By considering multiple assembly processes, each of which depends on the result 
of the previous step. The sequentiality is obtained by means of the final catalysis 
transitions. An example is provided in sec. |6l 

• By considering an internal chemical network in which the enzyme reacts with 
multiple substrates. This means that the internal network encompasses several 
assembly stages. In this case, the final catalysis basically corresponds to the 
assembly of the first substrate in the queue, while the following ones are still being 
processed. The correlations between successive assembly stages are thus embedded 
in the internal chemical network. 

The second kind of correlations appears to be similar to that introduced by 
J.J. Hopfield in [28]. In this scheme, when the enzyme proofreads the substrate, 
it accesses to a lower free-energy enzyme state. From this lower energy state, the 
reaction continues without proofreading, but when the enzyme catalyses the final 
reaction, the high-energy state is restored. Indeed, the final reaction is supposed to 
provide the enzyme with the energy required by this restoration (hence the name “relay 
proofreading”). Finally, an effective reduction of the error rate can be achieved without 
increasing the energetic discrimination constant by properly tuning the rate constants. 

Once formalized the basic assumptions about the kinetics of our system, we need 
to describe how the enzyme can discriminate the substrates. Since it undergoes 
the same kind of reactions with any substrate, the structure of reaction network 
leading from the free enzyme state to the final product state will be the same. In 
other words, the network topology does not discriminate. According to [16], the 
substrates discrimination is embedded in the rate constants. Indeed, the substrates 
are distinguished either because of differences of chemical complex binding energies 
(the typical example is DNA replication and RNA transcription, in which there is an 
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affinity difference between matching and mismatching nucleotide pairs), or because the 
activation energies related to the reaction involving the right substrate are lower than for 
the other ones (hgure [2](left), describing the energy landscape of the Michaelis-Menten 
reaction scheme, can help in visualizing these ideas). Following [16], we will refer to 
the hrst type of discrimination as energetic while the second one as kinetic. It is worth 
noting that all of these differences appear in the rate constants. 

The rates of the hnal catalysis transitions will not be written in Kramers form, due 
to their irreversible nature. We shall however assume that these rates are the same for all 
the substrates, since there is no substrate discrimination in this step. This assumption 
relies on the fact that most often the region of the enzyme involved in the discrimination 
is located far from that involved in the hnal binding, cf. p. 224]. 

The stochastic description of the process is described by a Master Equation [25] . 
[30] Ch. 5]. Denoting by Pi{t) the probability of Ending the chemical complex in the 
state i at time t, we can write it as 

[hjPjit) - kjiPiit )], ( 2 ) 

3 

where kij denotes the rate constant regulating the probability rate of the reaction leading 
from the state j to the state i. 

As shown in [25] and [SO] Ch. 5], this kind of processes evolves towards a unique 
steady state in which the probabilities are constant. Since we are not interested in 
the complete time evolution of these processes, but only in the steady-state regime of 
operation, we shall not describe in the following the transient initial stochastic dynamics. 
We shall instead directly focus on the steady state, which can be evaluated by either 
solving the linear set of equation related to the master equation ([2]) or by means 
of graphical methods [211 § 6], [25l §III]. Given the steady-state probabilities (pi), 
which are functions of the rate constants, we can evaluate all the relevant kinetic 
and thermodynamic quantities. Specihcally, for any given reaction scheme, the kinetic 
quantities we are interested in are described below, while the thermodynamic ones are 
discussed in the next paragraph. 


The error rate: 




rate of wrong catalysis: 


(3) 


total rate of catalysis: Jw + «^r ’ 
which describes the average rate of wrong outcome observed; 

The mean step time: 

T = -, (4) 

total rate of catalysis: + Jr' 

describing the average time needed to complete a step (this dehnition follows from 
that of mean hrst passage time of absorption in the hnal product state [211 Ch. 8] 
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At a first glance, the thermodynamic description of proofreading processes as described 
so far is complicated by the presence of the irreversible catalysis. However, it is clear 
that these transitions merely describe the snccession of the enzymatic assembly steps 
and do not take part in the discrimination itself. Thus we shall not take into account 
these steps in the expression of the entropy production and entropy flow. Following 
[21 ESI 132] the entropy production and entropy flow in the process are respectively 
given by 


[kjjPj - kjiPi] In 




probability flux 


kjiPi 
affinity 






‘ji 


(5) 

( 6 ) 


The primed sums run over all pairs of states, but exclude the hnal catalysis transition. 
Note that we consider the entropy flow as positive if it enters the system. The time 
derivative of the system’s Gibbs entropy is given by 

- ^'Y^Pi{t)lnpi{t) = Si +Se-Sc, (7) 


where the last term takes into account the contribution to the entropy change due to 
the catalysis transitions. More precisely. Sc can be viewed as the entropy change in 
the environment (hence the minus sign) due to the completion of the assembly. At the 
steady state, the expression on the left-hand side vanishes, but the single terms on the 
right-hand side do not vanish in general. By multiplying the entropy rates in the steady 
state by the mean-step time r (H we obtain respectively: 


The entropy production per step: 

AiS = rSi 


( 8 ) 


which is proportional to the amount of free energy irreversibly lost during a step; 

The entropy flow per step: 

AcS = rSc (9) 

which is the entropy reversibly exchanged with the reservoirs during a step. Let us 
highlight that in all the reaction schemes we describe, the process evolves from the 
free-enzyme state to lower-energy bound-substrate states. Thus these processes are 
exothermic and A^S is negative. 

The entropy change per step: 

= rSc ( 10 ) 


which is the entropy change in the environment due to the completion of just one 
step. Since assembly mechanisms bind free monomers into W and R hnal products. 
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thus decreasing their available phase space, A^S is negative. Furthermore, the 
fewer errors occur, the more ordered is the dual state, and the smaller is A^S. The 
explicit expression of this term depends on the chemical network topology. In sec. El 
we provide it for assembly processes devoid of correlations and with two competing 
substrates (eq. EB]) . 


The above-described quantities suggest us to dehne the efficiency of the 
proofreading process in the following way 


Ac5 . ^ AiF 
V = -ww; = 1 + 


AeS ^ ' Ae^' 

Since both AcS and AgS are negative and AiS is positive, this expression yields a value 
lying between 0 and 1. (An analogous dehnition of efficiency can be found in [33], 
applied to the process of information erasing in fermionic bits.) 

We shall exploit this thermodynamic description in the following sections. In 
particular, we will recover the thermodynamic trade-offs obtained in copolymerization 
models i uni El [16], and we will evaluate the corresponding efficiency. Finally, 
we highlight the following two points: (i) the splitting of the network into internal 
and external steps resembles the splitting into observable and masked steps, whose 
thermodynamics has been analysed in [35]; (ii) even if we had taken into account the 
irreversible catalysis path in the thermodynamic description, its contribution to the 
entropy production would have been hnite 


3. The Michaelis-Menten model 


Here we summarize the minimal reaction scheme describing assembly mechanisms, which 
follows the Michaelis-Menten (MM) enzymatic kinetics [29] sec. 4.5] [2], £g. |2](left). 

In this simple reaction scheme, the enzyme, in its free enzyme state &, reacts 
with one of the two substrates, r or w, thus forming the activated complex, &r or &w, 
respectively. Subsequently, it concludes the assembly step by means of the hnal catalysis 
transition. The MM reaction scheme provides us with a starting point for introducing 
the kinetic and thermodynamic description of the discrimination process. The present 
analysis essentially reproduces that of [16], in a form that can be easily extended to 
more complicated networks. 

Together with the reaction scheme shown in £g. |2](left), we consider the free-energy 
landscape in hg. |2](right). Given this energetic scheme and using the Kramers form of 
the rate constants we can write the reaction rates in the following form 

kf = kf = uie^, 

K = k- = uK, 


( 12 ) 


where oj is the overall time scale of the reaction pathway and e denotes the driving force 
acting on it. The discrimination constants, 7 and d, quantify the difference of binding 
free energy between the correct and the wrong activated complex and the difference of 
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Figure 2. Michaelis-Menten (MM) reaction scheme (left) and enthalpy landscape 
(right) for enzyme-assisted assembly processes involving two competing substrates. As 
mentioned in sec. [ 2 l the catalysis transitions are directly and irreversibly connected to 
the free-enzyme state. The driving energy, e, represents the enthalpy difference between 
the free enzyme state and the right product. The difference in binding energies between 
the wrong and right activated complex, 7 , provides the energetic discrimination, 
whereas the difference of activation energies, S, provides the kinetic discrimination. Let 
us emphasize that our representations of the free enthalpy landscapes simply provide a 
visual aid for understanding the models, and are not to be interpreted as experimentally 
reproducible curves. 


activation free energy between the correct and wrong free-enzyme-to-activated-complex 
reaction, respectively. 

The master equation for this system reads 

Ph= {F + k~) pw + (i^ + K) Pr - (^w + kf ) p&, 

Pw = A:+p& - {F + k~) pw, (13) 

Pr = kfpg, - {F + kf)pr, 

which, solved in the steady-state regime, leads to the following expressions for the 
probabilities: 

P& = — [F'^ + F (^k^ + ^w) + ^w] 1 

P^ = ^kf{F + kf), (14) 

Pr = ^kf {F + k-) . 

Here we have dehned the normalization factor S by 

S = -b F{k^ + kf + + ^w) + (^w F kf) + kfk,^. (15) 


Following the dehnition given in ([3]), and the Kramers form for the rate 
constants (na, we express the error rate as follows: 

Fpw + F 


^ Fpy, + Fp, (e'l' 4-1) + {e^ + 1) F' 

Analogously, the mean step time dehned in (jl]) as 

1 

T = 


Fpw -b Fp, ’ 


(16) 


(17) 
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while the entropy production per step and the efficiency are obtained by a simple 
application of ([ 8 ]) and flTT]) . Finally, the entropy change per step is expressed by 




Pr+P^ 


- , Pr . ^ 
Pr In — + Pw 

P&c 


In 


Ph 


(18) 


In order to determine the trade-offs between the error rate and the other kinetic and 
thermodynamic quantities, we observe that the former (eq. [T 6 l) is a monotonic function 
of the catalytic rate, F. We can thus invert ^{F) and substitute the result into the 
expression of the mean step time, entropy production per step and efficiency. In this 
way, all the thermodynamic quantities are expressed in terms of the error rate, which 
becomes an independent variable. The time-error, dissipation-error and efficiency-error 
trade-offs are shown in fig. |3l 

In the energetic discrimination regime [IB] {'1 > d), f monotonically increases for 
increasing catalytic rate. The minimum error rate is thus achieved in a quasistatic 
condition, i.e., vanishing catalysis, F —)■ 0. In this limit, the mean step time diverges, 
the dissipation vanishes and the efficiency reaches unity. The green curves in fig. [3] 
describe the trade-offs in the purely energetic discrimination regime and exhibits the 
above-described behavior close to the minimum error rate. On the contrary, in the 
kinetic discrimination regime [16] (5 > 7 ), ^ is monotonically decreasing with increasing 
F. In this case, the faster the process, the more discriminating it becomes, but this is 
accompanied by a larger dissipation needed to keep the process out of equilibrium. This 
discrimination regime is described by the blue curves in fig. |3l The smallest achievable 
error rate depends on the discrimination regime and has the following expression: 

C . — __ ^p-max{(5,7} /igi. 


3max{5,7} _j_ ^ 


Since only the maximum between the 7 and d contributes to the minimum error rate, 
there is no way of enhancing the discriminatory power of the model by combining the 
discrimination factors. 

The time-error and dissipation-error trade-offs are the same of those obtained 
by copolymerization models [16]. However, the definition of efficiency allows us to 
shed new light on this processes. Indeed, the efficiency-error trade-off in the kinetic 
discrimination regime exhibits a maximum in correspondence to the relative minimum 
of the dissipation. As the kinetic discrimination constant increases this maximum tends 
to one, as shown in fig. [3]^d). 


4. The Ninio-Hopfield model 

In the simple MM scheme, all the dissipated energy is spent to drive the process faster 
and not to proofread the outcome. However, this acceleration enhances the accuracy of 
the assembly only in the kinetic discrimination regime. In order to overstep the error 
rate thresholds obtained with the MM one needs a chemical mechanism in which the 
chemical energy provided by the environment is exploited to proofread the outcome, 
enhancing the pre-existing discrimination. 
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(a) (b) 





Figure 3. Mean step time (a), dissipation per step (b) and efficiency (c) versus 
the error rate in the MM model for different discrimination regimes. The reported 
graphs correspond to the following values for the coefficients: 7 = 3, e = 10, w = 1 
and d = 0 (green, purely energetic discrimination), <5 = 6 (light blue, mainly kinetic 
discrimination), <5 = 12 (dark blue, mainly kinetic discrimination), (d) Efficiency- 
error trade-off in the purely kinetic regime of discrimination. The coefficients values 
are: 7 = 3, e = 10, w = 1 and d = 6 (light purple), 6 = 8 (purple), <5 = 10 (dark 
purple). Interestingly, the efficiency exhibits a relative maximum in kinetic regime of 
discrimination and the maximum value tends to one as 6 increases. 


This can be realized by properly coupling the internal network directly with a 
chemical force. The first reaction scheme embedding this idea was independently 
proposed by J. Ninio [1] and by J. J. Hopfield [2]. We shall describe this model in the 
version proposed by Hopfield and shown in fig.[5l Two forces are coupled to the cycles of 
the network (one for each substrate) and drive the chemical complex preferentially in the 
direction & —)■ &s* ^ &s —?■ &. Indeed, with reference to the energy landscape shown 
in fig. |l](bottom) the system energy drops of an amount equal to ^ = e + ei + Cp when a 
cycle is performed. We are assuming that the chemical forces are equal for any substrate 
and the related affinity A is the same. The above-mentioned energy scheme traces the 
original Hopfield idea, but it is also provided with the kinetic discrimination scheme 
introduced in [8] and successively developed in [16]. According to Hopfield’s idea, the 
first and the proofreading reactions discriminate. On the one hand, the discrimination 
can be kinetic with discrimination constants denoted by 6 and (5p, for the first and 
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k . - ks* . - &s —^ S 



Figure 4. (top) Hopfield model reaction scheme; s denotes the generic substrate, 
i.e., s G {r, w}. (bottom) Free-energy diagram for the Hopfield model. The thick 
continuous line in (b) represents the Gibbs free energy potential along the reaction 
coordinate relative to the first two reactions: k ks* and ks* ks. The dashed 
line corresponds to the proofreading pathway. The energy difference between two 
consecutive free-enzyme states {A = e + + Cp) is the free energy consumed when 

a cycle & ks* ks —& is performed. The discrimination is performed along 
the first and/or the proofreading reaction, where 7 , S and 5p are the discrimination 
constants. 


the proofreading reaction, respectively. On the other hand the discrimination can also 
be energetic if, e.g., the binding energy of the wrong pre-catalysis state, &w, is larger 
than the corresponding one for the right pre-catalysis state—here 7 again denotes the 
difference of these energies. In this way, the wrong substrate is more likely to be rejected 
through the proofreading pathway. 

This picture can be represented by the rate constants, which are given by 
equation flT^ for the hrst reaction, and by 

hf=Uie"f hf=Ui, Kf = ujpe~^\ Kf = 

K = h~ =uji, Kf; = ujp, K~ = Upe^p^'^. 

for the second and the proofreading reactions. The related steady state can be evaluated, 
and the relevant kinetic and thermodynamic quantities derived. Finally, inverting the 
error rate as a function of the catalytic rate we obtain the trade-offs, shown in hg. |5l 
In our thermodynamic description, the relevant parameters are the discrimination 
constants ( 7 , 5 ,5p), since they determine the operation regime of the proofreading 
mechanisms (a detailed discussion on the role of the other constants is discussed 
















efficiency, 7 mean step time, 


Kinetic proofreading 


13 


(a) (b) 



10 -® 10 -^ 0.01 10 -* 10 -* 0.01 
error rate, ^ error rate, ^ 


(c) 



error rate, ^ 

Figure 5. Mean step time (a), dissipation per step (b), and efficiency (c) versus 
the error-rate for the Hopfield model. We have chosen the following values for the 
coefficients: e = 10, a; = 1, 7 = 3 and (i5,5p) = (0,0) (green curve: purely 
energetic discrimination), (5, (5p) = ( 6 , 0 ) (light blue curve: kinetic discrimination 
on the first chemical pathway), (i5,5p) = ( 0 , 6 ) (blue curve: kinetic discrimination 
on the proofreading chemical pathway), {S,dp) = ( 6 , 6 ) (dark blue curve: kinetic 
discrimination on both chemical pathways). The other constant, namely wi, Wp, ei, 
Cp, are those which minimize the error-rate function In this way, the minimum 
achievable error rate is recovered by the trade-offs. They have been obtained by 
numerically minimizing the error-rate function given the discrimination constants and 
the driving energy e, and are consistent with the values predicted by Hopfield in [5] 
The dashed orange curve refers to the equilibrium error rate for 7 = 3. Therefore, 
in the Hopfield model one consistently obtains smaller error rates than in the MM 
model for equal values of the discrimination constants. The light blue and blue curves 
highlight that kinetic and energetic discrimination regimes yield a resulting lower error 
rate only when they cooperate in the proofreading pathway. Remarkably, when the 
kinetic discrimination predominates on the first pathway the process becomes faster, 
more dissipative and more efficient. Finally, it is worth noting that high efficiency 
at lower error rates appears as a general trait of the kinetic discrimination for simple 
models, like the MM and the Hopfield ones. 
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in 13). With respect to the MM scheme, the working regimes of the Hopfield model 
are characterized by lower error rates on equal values of the discrimination constants. 
As already discussed by Hopheld, in the purely energetic discrimination regime the 
minimum achievable error rate equals the square of that obtained by the MM model 
(i.e., the equilibrium value) for the same values of the discrimination constants. When 
also the kinetic discrimination is taken into account the lowest achievable error rate 
becomes 

It is worth noting that thanks to the proofreading reaction pathway, the discrimination 
regimes can be partially mixed to further reduce the error rate, fig. |5l Indeed, although 
the discrimination factors of the first reaction pathway do not mix to reduce the error 
rate, those of the proofreading reaction do (blue curves in fig. [5]). The apparent 
contradiction with the results of [El is due to a different design of the free enthalpy 
landscape. The presence of the kinetic factor also speeds up the assembly at the price of 
a larger free-energy consumption (light blue curve in fig. [5]). Interestingly, the process 
becomes more efficient, too (fig. [5t). 

Let us point out that the minimum error rate is always achieved in the vanishing 
catalysis rate limit. In this regime all the free energy provided to the system is spent 
to proofread at the expense of the process speed. Hence, both the mean step time and 
the dissipation diverge approaching the minimum error rate. 

5. The Murugan-Huse-Leibler (MHL) Model 

The idea of embedding the network with more forces in order to reduce the error rate 
has been pursued in [T3], where a general scheme involving many reaction cycles was 
introduced. The authors of this work were able to show that the minimal error rate (with 
given value of the energetic discrimination constant 7 ) depends on the network topology, 
i.e., on the number of cycles appearing in the reaction graph. This class of models was 
further explored in a more general setting in [15]. In particular, the “railroad” scheme 
shown in £g. | 6 ]was introduced, as the most efficient one among a large class of reactions. 
We shall refer to this model as the MHL (Murugan-Huse-Leibler) model. Any substrate 
chemical network is coupled with N chemical forces of equal affinity A. These forces 
act on the N cycles of type ys„ —)■ Xs„ —)■ ?/s„+i —t 2 /s„ and drive the chemical 

complex towards the final product state on the x-chain of reactions and towards the 
free enzyme state on the ^-chain of reactions. The Xs^ Hsi reactions discriminate the 
substrates either kinetically or energetically. The discrimination constants, denoted by 
7 and (5, and will be taken to be for any of the -|- 1 pathways Xj. Vsi- 

We will focus our analysis to the behavior of the chemical network for varying 
number of independent forces, and thus of cycles. For this purpose we neglect the 
discrimination performed on the hrst stage & ys^ and assume that the related rate 
constants are the same for both substrates: kf = kf = ue’" and k~ = k~ = 00 . 
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Figure 6. MHL (“railroad”) model reaction scheme for the generic substrate s. 


Thus the rate constants related to the ladder part of the network are expressed by 
u, = dr = uy, /+ = Wf, /■ = Wfe""', 

tiw = Wue''“, d^ = ujue^, b'^ = Uh, 6“ = 

We can evaluate the error rate, the mean step rate and the efficiency by solving the 
steady-state dynamics, inverting the related error-rate function, and substituting the 
result in the relevant expressions, obtaining the results shown in fig. [71 

The relevant parameters are again the discrimination constants. In order to simplify 
the analysis, we discuss the energetic and kinetic discrimination regimes separately. The 
trade-offs in the energetic regime, taking into account different numbers of cycles, are 
shown in the figures [7] (a) and (b). In agreement with [H], the minimum error rate 
decreases exponentially as the number of cycles increases (^min (w+i) 7 ^_ However, 
when the number of cycles increases, both the mean step time and the dissipation per 
step increase, for equal values of the discrimination constants and error rate. This also 
plays a role on the efficiency, which accordingly decreases, fig. Etc). 

An analogous scenario appears in kinetic discrimination regime, whose dissipation- 
error trade-off is shown in fig.jTJd). The network needs at least two cycles to achieve error 
rates lower than that of the MM model, for equal values of the constants (^min ~ 

We thus infer that the most effective proofreading action is thus the result of the optimal 
combination of the network topology and discrimination regime. 

6. Michaelis-Menten model with correlations between consecutive steps 

As mentioned in sec. |2l in all the enzyme-assisted assembly schemes described so far 
all steps are similar to and independent from one another. In this section we take into 
account a simple scheme in which consecutive steps are correlated. 

We assume that a wrong step affects the system discriminatory power, i.e., that 
when the last step produces the wrong outcome, the discrimination constants decrease. 
We introduce this simple correlation scheme in the MM model and obtain the chemical 
network shown in fig. [HI Here, we consider two parallel reaction schemes: the first 
describes the steps following an error (denoted by the label w on the enzyme state, 
&w), while the second describes the steps following a correctly terminated step (denoted 
by the label r on the enzyme state, &r)- Finally, the connection between these two 
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(a) 


(b) 




(c) (d) 




Figure 7. Mean step time (a), dissipation per step (b), and efficiency (c) versus the 
error rate for the MHL model in the purely energetic discrimination regime (7 = 8 and 
(5 = 0). The plot (d), instead, shows the dissipation per step for the same model in the 
purely kinetic discrimination regime (7 = 0 and 6 = 3). The curves are distinguished 
by the number of cycles in the network: light green and light purple curves, = 1; 
green and purple curves, N = 2; dark green and dark purple curves, N = 3. The curves 
in (a-c) correspond to the following values of the parameters: 7 = 3, Cu = 8, ef = 8 
and Cb = 8, which represent the driving energy related to the reactions j/s,, 

Xsi —>■ Xsi^i and Usi+i —t Vsi, respectively. The other constant, Wu, Wf, Wb, are those 
which minimize the minimal error rate ^min, when the other parameters are kept fixed. 
The first two plots in (a) and (b) highlight the lowering of the error rate as the number 
of forces increase. However, both the mean step time and the entropy production 
increase and the system exhibits a progressively lower efficiency (c). Interestingly, in 
the kinetic regime (d), the system needs at least two forces in order to reduce the 
error rate. Let us observe that the model embedding just one cycle is very similar 
to Hopfield scheme, except for the non-discriminating pathways & ^ ys^. We thus 
suppose that these last pathways, detaching the free enzyme state from the part of the 
network performing the proofreading, spoil the discrimination in the kinetic regime. 
Finally, for the plot (d) the numeric constants chosen are: w = 1, e = 10, (5 = 3. The 
values of the other coefficients are those which minimize the minimal error rate ^min, 
and are given by Cu = 8, Cf = 9, Cb = 9, Wu = 0.1, uj{ = 10, Wb = 10). 


reaction networks is realized by the catalysis transitions. In this way, when the enzyme 
catalyzes the final reaction of the wrong substrate, the system comes back to the “wrong” 
free enzyme state. Instead, when the enzyme catalyzes the final reaction of the right 
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Figure 8. Michaelis-Menten (MM) model with correlations between consecutive 
steps. 


substrate, the system comes back to the “right” free enzyme state. As for the MM 
model, we assume that the catalysis transition does not discriminate and thus that the 
catalytic rate F is the same for the four possible outcomes. 

The type of correlation described above is implemented in the rate constants. For 
both networks they have the same expressions as in flT^ but with different discrimination 
constants. Following our hypothesis, the discrimination constants related to the wrong 
network, 7 ^ and 6^, are smaller than those for the right network, 7 r and 

We can solve the master equation in the steady-state regime and evaluate all the 
relevant quantities. The error rate and the mean step time are expressed by 

P(w)w T P{r)vj 




T(w)w + T(w)r + P{r)vi + P{r)r 
1 


(23) 


r = 


F (p(w)w + T(w)r + T(r)w + T(r)r) • 

The thermodynamic quantities can be expressed as in ([ 8 ]) and iUBi, taking care of 
excluding the catalysis transitions from the sums. However, we cannot express the error 
rate as a function of the catalytic rate in the general case (with four discrimination 
parameters). We will thus limit our analysis to the cases of purely energetic and purely 
kinetic discrimination, in which this expression is possible. 

Typical trade-offs are plotted in hg. |9] together with the trade-offs obtained by the 
MM with discrimination constants equal to 7 w and 7 r, and with 5^ and 6^. While the 
expression for the minimum achievable error rates, are given by 

e7w _|_ 1 / 1 _|_ g7w' 


C^min 

^min 


g7r+7w -|- 2e^'" -f 1 

e' 




gi5r+i5w _|_ 2 e'^'" -t -1 


e7w 

1 -|- e”^" 


.-7r 




( 24 ) 


in the energetic and kinetic regimes, respectively. 
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(a) (b) 





Figure 9. The light green and light purple lines represent the mean step time (a, 
c), and entropy production (b, d), as a function of the error rate for the MM model 
with correlations between consecutive steps, (a) and (b) correspond to the energetic 
discrimination regime 7 w = 2 , 7 r = 5, while (c) and (d) correspond to the kinetic one 
(5w = 2, (5r = 5. The curves are compared with the trade-offs obtained with the MM 
model (darker lines) for the same values of e and w, but taking into account separately 
the discrimination constants used for the model with correlations. We have chosen the 
following values for the driving force and the overall rate scale: e = 10 and uj = 1 (let 
us remind that these values are by assumption common for both the “right” and the 
“wrong” network). Remarkably, the reduction of the network discriminatory power is 
small. Indeed, the trade-offs we obtain are close to those obtained in the MM model 
by considering just the discrimination constants on the “right” pathway, namely the 
darkest curves (with 7 = 7 r and 5 = Jr). 


It is clear from this result that the discrimination loss due to a wrong outcome 
only slightly influences the global behavior of the kinetics. Indeed, for high values of 7 r 
and the difference between the nearest-neighbor and the simple MM model becomes 
negligible. 

Let us observe that when proofreading mechanisms are taken into account, like in 
Hopheld model, the probability that the step ends with a wrong incorporation decreases 
with respect to the MM model. Thus we expect that in the Hopheld model with nearest- 
neighbor interactions, the discrimination loss due to a wrong monomer incorporation 
would inhuence its global behavior even less than in the MM model. We conclude by 
remarking that these results are not in contradiction with the effective role of correlations 
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in biological systems. As already mentioned, in template-assisted assembly processes, 
correlations play a role in the individuation and proofreading of error, i.e., they can 
influence the internal rate constants when interaction with multiple substrates is taken 
into account. 

7. Conclusions 

Natural biological processes, like the enzyme assisted assembly processes that we have 
considered, must satisfy several, sometimes contradictory, requirements. The error rate 
should be lower than a threshold, the average duration of the process should not be 
too long, and the free-energy consumption should not be too large. It is likely that 
the detailed mechanisms of naturally occurring processes have evolved to optimize some 
combination of these quantities, whose expression depends on the requirements of the 
process itself. The method discussed in the present work, and the measure of efficiency 
that we have derived, can be useful in the evaluation of the performance of these 
mechanisms. 

We have been able to investigate a situation in which successive completed processes 
are not fully independent. In our situation, we have found that the correlations do not 
signihcantly modify the performance of the mechanism. We can however envisage more 
general situations, in which hidden correlations show up in the system performance. 
We have studied in a simple case the interplay of kinetic and energetic discrimination 
in many-cycle proofreading mechanisms, generalizing the results of [HIIS]. We have 
identihed a regime in simple proofreading models in which the introduction of a kinetic 
discrimination step improves both the speed and the efficiency of the process for a given 
discrimination rate. We can speculate that naturally occurring mechanisms work in a 
regime that optimizes some combination of these quantities—the particular combination 
depending on the details of the role of the process in the global cellular biochemical 
network. 

We can envisage extending the research to more general models, in which also 
the external chemical species can occur with varying concentration, driving the system 
out of equilibrium in several modes, as suggested, e.g., in |23]. This may lead to the 
understanding of more complex proofreading mechanisms, like those that take place in 
metabolic processes (cf. |37j). Indeed, the recent advances in the theory of the interplay 
of information and thermodynamics in non-equilibrium processes can help us in reaching 
a more precise understanding on the constraint obeyed by these fundamental processes 
in living systems. 
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